Journal: NAR Genomics and Bioinformatics
Article Title: Distribution of specific prokaryotic immune systems correlates with host optimal growth temperature
doi: 10.1093/nargab/lqae105
Figure Lengend Snippet: Distribution of immune systems in different T opt categories. ( A ) Definition of T opt categories and their bacterial and archaeal coverage in the SR-GTDB. ( B ) Distribution of genomes in the SR-GTDB for different T opt categories and phyla, with averages indicated. Genomes from phyla with < 30 accessions are grouped as ‘other bacteria’ and ‘other archaea’. ( C ) Distribution of the total number of immune systems in bacteria in the SR-GTDB per T opt category, represented by a raincloud plot (visualizing both individual data points and the probability density of the data) and a boxplot. The median is indicated in the boxplot, while the average (Av) is shown on the right. ( D ) Distribution of bacterial genome sizes for different T opt categories, represented by raincloud plots (visualizing both individual data points and the probability density of the data) and a boxplot. The median is indicated in the boxplot, while the average (Av) is shown on the right. ( E ) Percentage of bacterial genomes encoding specific immune system families (when found in ≥ 150 genomes) for different T opt categories. *, ** and *** indicate P -values < 1.43 × 10 −3 , 1.43 × 10 −4 and 1.43 × 10 −5 . P -values are determined by a χ 2 test of independence; cut-off values are determined by a Bonferroni test of α = 0.05 / number of comparisons. ( F ) Proportional distribution of different immune system categories for different T opt categories in bacteria. ( G ) Percentage of bacterial genomes with specific RM, CRISPR-Cas and pAgo subtypes identified (when found in ≥ 150 genomes) for different T opt categories. For RM, *, ** and *** indicate P -values 1.43 × 10 −3 , 1.43 × 10 −4 and 1.43 × 10 −5 . For CRISPR-Cas, *, ** and *** indicate P -values < 3.85 × 10 −3 , 3.85 × 10 −4 and 3.85 ×10 −5 . For pAgos, *, ** and *** indicate P -values < 1 × 10 −2 , 1 × 10 −3 and 1 × 10 −4 . P -values are determined by a χ 2 test of independence; cut-off values are determined by a Bonferroni test of α = 0.05 / number of comparisons.
Article Snippet: Bar graphs, scatter plots, pie charts and heat maps were made using Microsoft Excel and Adobe Illustrator, with the exception of the Sankey diagram [made using SankeyMATIC ( https://sankeymatic.com/ )], raincloud and bee-swarm plots (made using python package Plotly version 5.14.0) and the Venn diagram (made using the function venn2 of the python package matplotlib-venn version 0.11.7).
Techniques: Bacteria, CRISPR